A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology
نویسندگان
چکیده
Over the last few decades, a number of reinforcement learning techniques have emerged, and different reinforcement learning-based applications have proliferated. However, such techniques tend to specialize in a particular field. This is an obstacle to their generalization and extrapolation to other areas. Besides, neither the reward-punishment (r-p) learning process nor the convergence of results is fast and efficient enough. To address these obstacles, this research proposes a general reinforcement learning model. This model is independent of input and output types and based on general bioinspired principles that help to speed up the learning process. The model is composed of a perception module based on sensors whose specific perceptions are mapped as perception patterns. In this manner, similar perceptions (even if perceived at different positions in the environment) are accounted for by the same perception pattern. Additionally, the model includes a procedure that statistically associates perception-action pattern pairs depending on the positive or negative results output by executing the respective action in response to a particular perception during the learning process. To do this, the model is fitted with a mechanism that reacts positively or negatively to particular sensory stimuli in order to rate results. The model is supplemented by an action module that can be configured depending on the maneuverability of each specific agent. The model has been applied in the air navigation domain, a field with strong safety restrictions, which led us to implement a simulated system equipped with the proposed model. Accordingly, the perception sensors were based on Automatic Dependent Surveillance-Broadcast (ADS-B) technology, which is described in this paper. The results were quite satisfactory, and it outperformed traditional methods existing in the literature with respect to learning reliability and efficiency.
منابع مشابه
Techniques to Provide Resilient Alternative Positioning, Navigation, and Timing (APNT) Using Automatic Dependent Surveillance - Broadcast (ADS-B) Ground Stations
The United States Federal Aviation Administration (FAA) Alternative Positioning, Navigation, and Timing (APNT) program is examining the use of existing FAA terrestrial infrastructure to provide navigation capable of continuing US National Airspace System (NAS) operations should Global Navigation Satellite System (GNSS) position, navigation, and timing (PNT) services be unavailable. The approxim...
متن کاملProtection Method for Data Communication between ADS-B Sensor and Next-Generation Air Traffic Control Systems
Communications, Navigation, Surveillance/Air Traffic Management (CNS/ATM) systems utilize digital technologies, satellite systems, and various levels of automation to facilitate seamless global air traffic management. Automatic Dependent Surveillance-Broadcast (ADS-B), the core component of CNS/ATM, broadcasts important monitoring information, such as the location, altitude, and direction of ai...
متن کاملMulti-Sensor Fusion with Interacting Multiple Model Filter for Improved Aircraft Position Accuracy
The International Civil Aviation Organization (ICAO) has decided to adopt Communications, Navigation, and Surveillance/Air Traffic Management (CNS/ATM) as the 21st century standard for navigation. Accordingly, ICAO members have provided an impetus to develop related technology and build sufficient infrastructure. For aviation surveillance with CNS/ATM, Ground-Based Augmentation System (GBAS), A...
متن کاملIntegrated Display and Simulation for Automatic Dependent Surveillance–Broadcast and Traffic Collision Avoidance System Data Fusion
Automatic Dependent Surveillance-Broadcast (ADS-B) is the direction of airspace surveillance development. Research analyzing the benefits of Traffic Collision Avoidance System (TCAS) and ADS-B data fusion is almost absent. The paper proposes an ADS-B minimum system from ADS-B In and ADS-B Out. In ADS-B In, a fusion model with a variable sampling Variational Bayesian-Interacting Multiple Model (...
متن کاملADS-B used in Improvement of Air Traffic Control
Wang, Hao. M.S.A.A., Purdue University, May 2015. ADS-B Used in Improvement of Air Traffic Control. Major Professor: Dengfeng Sun. Automatic dependent surveillance – broadcast (ADS–B) is a central component of the NextGen air traffic control program. This surveillance technology can replace current secondary surveillance radars (SSR) and improve cockpit situational awareness. The service can im...
متن کامل